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DETAILED ACTION 

Remarks 

1 . In response to communications filed on 20 March 2007, claims 1 , 3, 6, 10, and 
14 are amended and claim 8 is cancelled. Claims 1-7 and 9-16 are pending in the 
application. 

Claim Objections 

2. Claim 1 3 is objected to because of the following informalities: the claim contains 
the limitation "said frequency of the word", yet in the parent claims, only a "number of 
occurrences" is being determined. 

3. Claim 14 is objected to because of the following informalities: the third limitation 
states "determining whether the word is unnecessary for identifying the given category 
on the basis of the recognized frequency". However, because of the amendment, there 
is now a recognized "number of occurrences" rather than a recognized "frequency". 
Appropriate correction is required. 



Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S. C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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5. Claims 1-7, 10-12, and 14-16 rejected under 35 U.S.C. 103(a) as being 
unpatentable over Esposito et al . ("A Machine Learning Approach to Web Mining") in 
view of Liu et al . (US Pre-Grant Publication 2003/0154181). 

As to claim 1 , Esposito et al . teaches a document automatic classification 
system, comprising: 

List generation means for generating a word list for each of at least two 
categories by extracting words from a learning document set (see page 192, section 3, 
paragraph 1 and Figure 1 ); 

unnecessary word determination means for relatively determining an 
unnecessary word for a category (see page 193, paragraph 3) 

Esposito et al . does not explicitly teach on the basis of a number of occurrences 
of a given word within at least one other category by using the list generated by said list 
generation means; and 

Liu et al . teaches on the basis of a number of occurrences of a given word within 
at least one other category by using the list generated by said list generation means 
(see paragraphs [0070]-[0075]. "discriminative features are those that occur more 
frequently inside a particular cluster than outside that cluster, whereas non-disciminative 
features are those that have similar occurrences frequencies among all the clusters. 
What the metric DFM(/i) reflects is exactly this disparity in occurrences frequencies of 
feature f/ among different clusters. In other words, the more discriminative the feature f lt 
the larger value the metric DFM(/j) takes", paragraph [0074]). 
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Esposito et al . as modified teaches: 

Means for generating a document classification catalog by eliminating words 
determined to be unnecessary words from each of the word lists (see Esposito et al . 
page 193, paragraph 3. The first 'N' words per dictionary are chosen as representative 
words. The others are unnecessary, and are not included in the list of representative 
words. Therefore, they are eliminated. Also see Esposito et al .. Table 1 , page 194). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to have modified Esposito et al . by the teaching of Liu et 
a]., since Liu et al . teaches that "the document clustering method is autonomous, 
unsupervised, and performs document clustering without the requirement of domain- 
dependent background information, nor predefined document categories or a given list 
of topics" (see paragraph [0037]). 

As to claim 2, Esposito et al . as modified teaches wherein said list generation 
means generates a list indicating a frequency of appearance of a given word for each 
category (see page 193, paragraph 2, "class dictionary", and Figure 1, Class 
Dictionaries. Also see Liu et al .. paragraph [0072], equation 12. A list including the 
frequency of occurrence for each feature in each class is determined). 

As to claim 3, Eoosito et al . as modified teaches wherein said unnecessary word 
determination means extracts a word belonging to a given category and determines it to 
be an unnecessary word in response to the word having a greater number of 
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occurrences in another category than is allowed by a given standard (see Liu et al . 
paragraph [0074], and equations 11, 12, and 13 in paragraphs [0072]-[0073]. The most 
frequent class that a feature word is determined, and compared with the frequency of 
the other classes. A feature word is chosen not to be discriminating if it occurs too often 
in other classes. "In an illustrative embodiment, discriminative features are defined as 
those whose DFM values exceed the predefined threshold T df .). 

As to claim 4, Esposito et al . as modified teaches wherein the given standard is 
determined according to a predetermined threshold (see Liu et al . paragraph [0074]). 

As to claim 5, Esposito et al . as modified teaches further comprising: 
Document classification means for performing classification processing for 

classification target documents by using said document classification catalog (see 

Esposito et al . page 194, section 4, paragraph 1 ). 

As to claim 6, Esposito et al . as modified teaches a document automatic 
classification system, comprising: 

A classified document set storage device for storing documents classified 
according to at least two categories (see page 191, section 2, and page 194, Table 1); 

A category table generation unit for generating a table (see page 193, 
paragraphs 1-2, "Class Dictionary", and Figure 1), the table comprising: 
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Word lists corresponding to each of the at least two categories wherein the word 
lists are generated by extracting words from a learning document set (see page 193, 
paragraphs 1-2, "Class Dictionary" Figure 1, and Table 1); and 

Frequencies comprising a number of occurrences of each extracted word within 
the learning document set (see page 193, first bullet, "MaxTF(l, t), the maximum value 
of TF(I, j, t) on al training documents of class I"); 

An unnecessary word elimination unit for eliminating an unnecessary word from a 
category in the table (se page 193, paragraph 3) 

Esposito et al . does not teach on the basis of a number of occurrences within at 
least one other category of a given word, 

Liu et al . teaches on the basis of a number of occurrences within at least one 
other category of a given word (see Liu et al . paragraphs [0070]-[0075]); 

Esposito et al . as modified teaches wherein said unnecessary word elimination 
unit extracts a word belonging to a given category and eliminates the word as an 
unnecessary word from said table in response to the word appearing more frequently in 
another category than is allowed by a given standard (see Esposito et al . page 193, 
paragraph 3, and Liu et al . paragraphs [0074]); 

A classification catalog storage device for storing the table from which the 
unnecessary word was eliminated by said unnecessary word elimination unit (see page 
193, section 3. Also see Esposito etal .. Table 1 , page 194). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to have modified Esposito et al . by the teaching of Liu et 
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al, since Liu etal . teaches that "the document clustering method is autonomous, 
unsupervised, and performs document clustering without the requirement of domain- 
dependent background information, nor predefined document categories or a given list 
of topics" (see paragraph [0037]). 

As to claim 7, Esposito et al . as modified teaches: 

A classification target document storage device for storing classification target 
documents to be classified (see page 197, last paragraph); and 

A document classification processing unit for performing classification processing 
for the classification target documents stored in said classification target document 
storage device by using said table stored in said classification catalog storage device 
(see paragraph [0197]). 

As to claim 10, Esposito et al . teaches an unnecessary word determination 
method in a document automatic classification system, comprising the steps of: 

Generating a word list for each of at least two categories by extracting words 
from a learning document set (see page 192, section 3, paragraph 1), the word list 
containing information on a frequency of appearance of each extracted word within 
each category (see page 193, paragraphs 1-2, "Class Dictionary", Figure 1, and Table 

D; 

Determining an unnecessary word for a category (see page 193) 
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Esposito et al . does not explicitly teaches on the basis of a relative number of 
occurrences of a given word within at least one other category; 

Liu et al . teaches on the basis of a relative number of occurrences of a given 
word within at least one other category (see Liu et al . paragraphs [0070]-[0075]); 

Esposito et al . as modified teaches: 

Eliminating words determined to be unnecessary words from each of the word 
lists (see Esposito et al . page 1 93, paragraph 3). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to have modified Esposito et al . by the teaching of Liu et 
al., since Liu et al . teaches that "the document clustering method is autonomous, 
unsupervised, and performs document clustering without the requirement of domain- 
dependent background information, nor predefined document categories or a given list 
of topics" (see paragraph [0037]). 

As to claim 1 1 , Esposito et al . as modified teaches wherein, in said step of 
determining the unnecessary word, the unnecessary word is determined according to 
whether one word selected from the given category appears in said other categories 
more frequently than is allowed by a given standard (see Liu et al . paragraph [0074], 
and equations 11,12, and 13 in paragraphs [0072]-[0073] and the arguments in regards 
to claim 3). 
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As to claim 12, Esposito et al . as modified teaches wherein said given standard 
is a value obtained from a predetermined given threshold (see Liu et al . paragraph 
[0074]). 

As to claim 14, Esposito et al . teaches an unnecessary word determination 
method in a document automatic classification system, comprising the steps of: 

Acquiring information on words from a document set, classifying the words 
according to category, and storing the words in a storage device (see page 193, 
paragraphs 1-2, "Class Dictionary", Figure 1, and Table 1); 

Esposito et al . does not explicitly teach recognizing a number of occurrences 
within at least one other category of a word belonging to a given category on the basis 
of the acquired information; 

Liu et al . teaches recognizing a number of occurrences within at least one other 
category of a word belonging to a given category on the basis of the acquired 
information (see Liu et al . paragraphs [0070]-[0075]); 

Esposito et al . as modified teaches determining whether the word is unnecessary 
for identifying the given category on the basis of the recognized frequency (see 
Esposito et al . page 193, paragraph 3); and 

Generating a document classification catalog by eliminating words determined to 
be unnecessary words (see Esposito et al . page 193, paragraph 3). 
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As to claim 15, Esposito et al . teaches further comprising storing said 
classification catalog into the storage device (see page 193, paragraph 3). 

As to claim 16, Esposito et al . as modified teaches further comprising the step of 
performing classification processing for classification target documents by using the 
classification catalog stored in said storage device (see page 197, last paragraph). 

6. Claim 9 is rejected under 35 U.S.C. 103(a) as being unpatentable over Esposito 
etal . ("A Machine Learning Approach to Web Mining") in view of Liu et al . (US Pre- 
Grant Publication 2003/0154181), and further in view of Grasso et al . (US Pre-Grant 
Publication 2004/0254911). 

Esposito et al . as modified teaches the system according to claim 6. 

Esposito et al . as modified teaches wherein said table contains information on 
each word, a frequency of appearance of each word (see page 193, "Class Dictionary", 
Figure 1, and Table 1), 

Esposito et al . as modified does not teach and a part of speech of each word. 

Grasso et al . teaches and a part of speech of each word (see paragraph [0037]). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to have modified Esposito et al . by the teaching of Grasso 
et al .. since Grasso et al . teaches that "using this information, it is possible to determine 
whether a word is occurring with above average frequency in a specific text compared 
with how frequently it appears on average" (see paragraph [0039]). 
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7. Claim 13 is rejected under 35 U.S.C. 103(a) as being unpatentable over Esposito 
etal. ("A Machine Learning Approach to Web Mining") in view of Liu et al . (US Pre- 
Grant Publication 2003/01 54181 ), and further in view of Mohan et al . (US Patent 
6,970,881). 

Esposito et al . as modified teaches wherein said given standard is determined 
according to said frequency of all words in said other categories (see page 193, section 
3, paragraph 3, and Liu et al . paragraphs [0070]-[0075]) 

Esposito et al . as modified doesn't teach and a total frequency of all words in 
said other categories. 

Mohan et al . teaches and a total frequency of all words in said other categories 
(see 8:48-50). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to have further modified Esposito et al . by the teaching of 
Mohan et al .. since Mohan et al . teaches that "Concepts having probabilities within a 
certain range are selected as key concepts to represent the theme, or meaning, of an 
object. By setting the range, it is possible to dramatically increase precision and recall 
for objects classification" (see 3:43-47). 

Response to Arguments 

8. Applicant's arguments with respect to claims 1-7 and 9-16 have been considered 
but are moot in view of the new ground(s) of rejection. 
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Conclusion 



9. 



Any inquiry concerning this communication or earlier communications from the 



examiner should be directed to Charles D. Adams whose telephone number is (571) 
272-3938. The examiner can normally be reached on 8:30 AM - 5:00 PM, M - F. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Charles Rones can be reached on (571) 272-4085. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



Charles Adams 
AU2164 





